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A review of the literature indicated that 
tests are inadeauate for accurate assessment of 
discrimination ability in children. Hiaher 
ation tests than those which would he predicted 
measures seemed to implicate task variables, 
lty, repeated contrast test pairs, consisting 
which the same phonemic contrast occurs twi 
ultiple restino sessions were also employed 
difficulty. Two experiments were conduced to 
of task variables. The first experiment invol 
syllables in an A-B-X paradigm. The subjects 
Kindergarten children. The second experiment involved 
eal word items from the Wepman Test of Auditory 
on and the sublects were the v lrst Grade group from 
and a new r irst Grade group of 12. Experiment T showed 
strast pairs easier +o discriminate than initial or final 
rs and that results were poorer on the first day. In 
I, first day performance was also significantly poorer 
er davs. It is suggested that repeat®! testing is 
r young children and that repeated contrast pairs may 
ans of obtaining more complete assessment of phonological 
on ability in children. (Author/iR) 
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STATEMENT OF FOCUS 



The Wisconsin Research and Development Center for Cognitive Learning 
focuses on contributing to a better understanding of cognitive learning by 
children ond youth and to the improvement of related educational practices. 

The strategy for research and development is comprehensive. It includes 
basic research to generate new knowledge about the conditions and processes 
of learning and about the processes of instruction, and the subsequent devel- 
opment of research-based instructional materials, many of which are designed 
for use by teachers and others fur use by students. These materials are tested 
and refined in school settings. Throughout these operations behavioral scien- 
tists, curriculum experts, academic scholars, and school people interact, in- 
suring that the results of Center activities are based soundly on knowledge of 
subject matter and cognitive learning and that they are applied to the improve- 
ment of educa’.lonal practice. 

This Technical Report is from the Language Concepts and Cognitive Skills 
Related to the Acquisition of Literacy Project in Program 1. General objectives 
of the Program are to generate new knowledge »%bout concept learning and cog- 
nltlve skills, to synthesize existing knowledge, and to develop educational 
matetlals suggested by (he prior activities. Contributing to these Program 
objectives, this pro set’s basic goal is to determine the orocesses by which 
children aged 4 to 7 learn to read, examining the development of related cog- 
nitive and language skills, and to identify the specific reasons why many 
children fail to learn to read. Later studies will be conducted to find experi- 
mental techniques and tests foe optimlz'nq the acquisition of skills needed 
for learning to read, By-products of this research program include methodo- 
logical innovations in testing paradigms and measurement procedures; the 
present study is an example. 
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ABSTRACT 



A review of the literature led the present investigators to conclude that 
conventional tests are Inadequate for accurate assessment of phonological 
discrimination ability in children. Higher error rates on discrimination tests 
than those which would be predicted from articulation measures seemed to 
implicate task variables. To reduce task difficulty, the present investigators 
developed repeated contrast test pairs. Such pairs consist of CVC syllables 
in which the same phonemic contrast occurs twice (e.g, , Aob/ - /dod/). 
Multiple-testing ressions were also employed to reduce initial task difficulty. 

Two experiments were conducted to determine the effects of task varia- 
bles. In Experiment 1, nonsense syllables were used in an A-B-X paradigm. 
Subjects were 12 First Grade and 12 Kindergarten children. Mixed lists of 
repeated contrast, initial contrast, and final contrast pairs were tape re- 
corded and presented to each S, one list per day for 6 days. 

Experiment 11 involved using the real word Items from the Wepman Test 
of Auditory Discrimination. Subjects were the 12 First Grade Ss from Experi- 
ment I and 12 additional First Grade Ss. Testing was conducted following the 
instructions lor administration of the test. Four testing sessions were given 
each S. 

Tne results for Experiment I showed that repeated contrast pairs were 
easier to discriminate than either initial or final contrast pairs and that there 
was no difference between initial and final contrast pairs. Performance on 
Day 1 was significantly poorer than on all other days, and there was no d if - 
ference among the other $ days. 

For Experiment II, performance on Day 1 was also significantly poorer 
than on the other days. In addition, it was found that there was no dif f er- 
ence in performance for "new” and "old* Ss. 

The results have two major implications: First, repeated test.ng Is a 
necessity for young children. Second, repeated contrast pairs may provide 
a means of obtaining a more complete assessment of phonotogical discrim- 
ination ability in children. 
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INTRODUCTION 



BACKGROUND LITERATURE 

Discrimination of speech sounds, in some 
sense, must precede the production of speech 
sounds. If a speaker can consistently articu- 
late a given speech sound, he must be obis to 
discriminate that sound from other sounds. 
Implicitly, investigators have assumed that 
phonological discrimination tasks measure the 
skills prerequisite to articulation. On the 
basis of this assumption, researchers have 
attempted to isolate discriminatory deficiencies 
and show that these deficiencies are related to 
articulatory problems. The purpose of this 
study is to test the validity of the instruments 
used in assessing speech sound discrimination 
ability and to develop ways of improving tradi- 
tional tests. 

The study of discrimination skills always 
involves inference. A task intervenes between 
behavior ana observation. Consequently, it 
would seem that the task is a crucial factor in 
speech discrimination experimentation . A re- 
view of past research in which measurements 
of speech discrimination were undertaken is in 
order. The following survey concerns three 
common speech sound discrimination t< sts, the 
Travis and Rasmus Speech Sound Discrimination 
Test, the Templin Speech Sound Discrimination 
Test, and the YVepman Test of Auditory Dis- 
crimination. The bulk of research employing 
these tests has involved nttempts to relate 
*ound discrimination abihty to articulation 
ability. Because such attempts are related to 
theoretical assumptions about discrimination 
ability, the research will be discussed in some 
detail. 

Conventional Methods of Assessing 
Phonemic Discrimination 

The Travis and Rasmus Speech Sound Dis- 
crimination Test (1931) represents an early 



attempt to assess discrimination ability and 
relate it to articulation ability. This test con- 
sists of 331 contrasting pairs (e.g., /ta/ - 
/da/) and 3S non-contrasting pairs (e.g., 

/ta/ - /ta/). Subjects, both adults and chil- 
dren in this case, were asked to make same or 
different Judgments in response to the pairs as 
they were presented orally. Three hundred 
pairs Involve consonants, and 66 pairs involve 
single vowels, All consonant contrasts occur 
in consonant-vowel (CV) nonsense syllables, 
the vowel being /a/. The practical advantage 
of a test of this size is questionable. Since 
the Travis and Rasmus Test, researchers have 
only used "minimally" different speech sounds 
in contrast pairs. 

Travis and Rasmus (1931) compared Ss with 
normal articulation and Js who had mild-to- 
severe functional disorders of articulation. 

Four age levels were studied: adults, Fourth 
and Fifth Graders, oecond and Third Graders, 
and Junior primary and First Grade pupils. The 
investigators concluded that Ss with normal 
articulation are superior in speech sound dis- 
crimination skill to articulatory defectives at 
all age levels. The important point, for our 
purposes, is not the conclusion itself, but the 
data on which the conclusion w&$ based. Con- 
trol Ss with normal articulation scores In the 
Junior primary and First Grade group (n * SO) 
made an average of 8. 2 errc f s on the discrim- 
ination test. Since less than 10& of the con- 
trast pairs included on the Travis and Rasmus 
Test are what are know* as "minimal" contrasts 
(i.e., contrasts involving only single-feature 
changes), this error rate seems surprisingly 
high in comparison to what Is known about 
articulation abilities for this age level (Venezky 
& Calfee, 1968). Throughout the ensuing sur- 
vey, an attempt will be made to emphasize the 
si 2 e of the error rates obtained in speech sound 
discrimination measures for young children. 

The Travis and Rasmus Speech Sound Dis- 
crimination Test was one of a battery of tests 

1 
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used by Hall (1938). Her purpose, like that 
of Travis and Rasmus, was to compare speech 
defectives with normal speakers. Subjects 
were university freshmen and elementary 
school children. In direct opposition to the 
findings of Travis and Rasmus, Hall found that 
functional articulatory speech defectives, at 
both age levels, showed no inferiority to 
normal speakers on any of her mo, sures. The 
elementary school control Ss (N -■ 64; age - 7 
through 13 years) In Hall's study made a mean 
of 15.10 rsrors on the discrimination test. 

The fact that this is nearly twice as large as 
the error rates found by Travis and Rasmus for 
Junior piimary and First Grade Control Ss Is 
strange. Just as strange is Hall's finding of 
an average of '8 .4 errors for the university 
freshmen control S s . This mean is almost 
identical to that of the junior primary and First 
Grade control Ss in the Travis and Rasmus 
study. Hall explains that this discrepancy 
may have arisen because of experimental 
group selection procedure, experimenter vari- 
ation, or, most importantly, because Travis 
and Rasmus tested control Ss in groups and 
experimental Ss in groups and experimental 
Ss individually, while both experimental and 
controls in the Hall study were tested In the 
same groups . 

The Travis and Rasmus Test was replaced 
when it became apparent that only minimally 
different speech sounds should be incorporated 
in discrimination tasks . Teinplin (1943), in 
an an attempt to produce a useable, "short 
test” of sound discrimination, devised a 70- 
item test consisting of SI contrasting pairs 
and 19 non -contrasting pairs, Although 
Templin's Test used nonsense syllables and 
relies on same-different Judgments, it is 
different, In several respects, from the Travis 
and Rasmus model. While Travis and Rasmus 
used only CV syllables to tost consonant dis- 
crimination, Templln used CV, VC, and VCV 
syllables. She also utilized three vowels 
and a diphthong in constructing the syllables 
(/l/, /e/, /o/, and /al /) . Furthermore, no 
vowel contrasts are included in the Templln 
Test. 

Subjects for tne Templln Test were 301 
pupils atteuJing Second through Sixth Grade, 
Subjects showed a mean of 18. 29 errors, or 
an overall error rate of 26%. The error rate 
may represent only a lower limit, since 
Templln seems to have lumped together both 
"same" and •different" errors in the figures 
she reported. 1 In any case, the error rate is 



*it has been shown that performance on 
"same" pairs is significantly better than. pet- 



well above what the present authors feel 
would be an accurate picture of sound dis- 
crimination abilities of this age grr < .. The 
short test discussed here and a shorter ver- 
sion consisting of 50 same and different Items 
(Templln, 1957) have been utilized by several 
investigators who were concerned with the 
relation between articulation abilities and 
phonemic discrimination ablitles. 

Kronvall and Diehl (1954) matched a group 
of 6- to 9-year-old speech defectives (N = 30) 
with a normal group (N = 30). Using the 70- 
item Templin Speech Sound Discrimination 
Test, they found that the speech defective 
group made significantly more discrimination 
errors. Cohen and Diehl (1963) replicated 
the Kronvall and Diehl study and confirmed 
the results. These two studies yielded 12.37 
and 12.13 mean, errors, respectively for con- 
trol Ss. Both figures constitute a 17% overall 
error rate. 

Sherman and Gelth (1967) reversed the 
normal procedure in selecting Ss for their 
study. They first administered the Templin 
Speech Sound Discrimination Test to 529 Kin- 
dergarten pupils; then they selected 18 high- 
scoring Ss and 18 low-scoring Ss in order to 
compare the two groups on an articulation 
test (Templln-Dar)ey Picture Articulation Test). 
The investigators used this procedure because 
they felt that other studies, which did not 
find significant differences between defec- 
tives and normals, failed to use appropriate 
selection procedures. Subjects in previous 
studies were chosen on the basis of their 
articulation deficiencies and consequently 
were heterogeneous with respect to the 
etiology of the speech disorder. Results of 
the Sherman and Gelth study showed a sig- 
nificant relationship between high discrimina- 
tion errors and high articulation errors. The 
Investigators concluded that deficient dis- 
crimination skill is causally related to poor 
articulation. No overall error rates or item 
analyses from the sound discrimination test 
are available for this study. 

The Wepman Auditory Discrimination Test 
(1 9SS) has also been relied upon as an instru- 
ment in correlational studies. Wepman* s Test 
contains 40 real-word pairs— 10 "same" pa*rs 
and 30 "different" pairs. Only errors on 



forr.ance on "different" pairs (8riere, 1967). 
If "same" pairs are included in calculating 
error rates, the estimate will be lower than 
estimates based solely on "different" pairs. 
Thus, if Templin counted all errors, a cor- 
rected estimate (based only on •different'’ 
pairs) would be greater than the 26% cited. 
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"different" pairs are scored. "Same" pairs 
were included to serve as a check on the 
validity of the test (Wepmart, 1960). Both 
members of a contrasting pair are equated for 
frequency of occurrence according to the 
Thorndike-Lorge count (1944). All consonant 
contrasts on the test constitute changes in 
place of articulation only. In addition to con- 
sonant contrasts, four vowel contrasts appear 
on each of the two test forms available. 

Wepman (I9 60), in a preliminary report on the 
findings of studies using his test, concluded 
that there was a definite relation between poor 
articulation and poor discrimination but no 
data were given. 

Using the Wepman To st, Frins (1963) ap- 
proached the traditional question in a new and 
Interesting way. He argued that attempts to 
correlate a gross articulation score with an 
overall speech sound discrimination score 
were unfruitful; rather, if a relation between 
the two skills exists, then a relation between 
specific deviations of speech sound production 
and phonemic discrimination should be demon- 
strable . 

Prins obtained articulation measures on 
First Grade pupils, and selected experimental 
Ss on the basis of more than three errors. 

Each SF s errors were categorized by type. 

Then the Wepman Test of Auditory Discrimina- 
tion was administered. It was found that 
children in the experimental group whose 
articulation errors were phonemic substitutions 
involving the place feature tended to make 
many errors on the Wepman Test. (Recall that 
Wepman Test items all involve place contrast.) 
Those children in the experimental g-oup whose 
substitution errors differed greatly from the 
target phoneme made fewer errors on the Wep- 
man Test. There was no significant difference 
between the Wepman scores of the experimental 
group ahd those of a control group chosen on 
the basis of few articulatory errors . The con- 
trol Ss showed a mean of 5.58 errors on the 
Wepman Auditory Discrimination Test; this 
constitutes an overall error rate of 18.6%. 

Attempts to Improve 
Conventional Testing 

No speech sound discrimination error rate 
for normal groups reported In the literature was 
less than the 17% reported by Kronvall and 
Diehl (1954). Since the assumption was made 
that speech sound discrimination tests should 
be measuring a skill prerequisite to articula- 
tion, the error rates cited are unusually high. 

It would seem reasonable to conclude that con- 
ventional tests are not measuring discrimina- 



tion skill alone. Thus, the validity of these 
tests is questionable. A research program 
was initiated at the Wisconsin Research and 
Development Center to isolate variables af- 
fecting the measurement of sound discrimina- 
tion. Four experiments are reported as repre- 
sentative of the experiments conducted. De- 
tails of methodology have been omitted from 
the following account. The Appendix is a 
summary of the procedures employed in each 
of the experiments. In general, each experi- 
ment constituted an attempt to produce a 
speech sound discrimination test that would 
allow valid comparisons of discrimination 
and articulation errors. 

For the first experiment, the instrument 
used was designed to avoid the overt same- 
different procedure employed by Templin (1943) 
and Wepman (1958). It was felt that some Ss 
in this age group (5-6 years) were unable to 
comprehend the relevant dimensions required 
in making same-different Judgments. Further- 
more, the A-X paradigm (l.e., same-different 
procedure) has recently been criticized by 
Briere (1967). He shows evidence that "an 
A-X paradigm In language perception might 
simply be a measure of the SJ s bias to respond 
'same' or ’different'." Also, error rates for 
"same" pairs are significantly lower than error 
rates for "different" pairs* 

CVC nonsense syllables were employed In 
an A-B-X paradigm 2 in an attempt to control 
for differential familiarity of test Items. All 
contrasts employed in this test were initial 
consonant contrasts. Subjects showed an 
average of 5.7 errors. This constitutes an 
overall error rate of 18.9%. 

The second experiment was, in purpose 
and design, similar to the first. Final con- 
sonant contrasts In CVC nonsense syllables 
were employed. Subjects made an average of 
7 . 3 errors , or an overall error rate uf 24.5%. 
Obviously, the goal of deflating error rates 
was not realized In the first two experiments. 
In the third experiment, attention was turned 
to the relationship between articulation and 
discrimination . 

An imitative articulation test similar to 
Templin’s (1947) was devised and adminis- 
tered to Kindergarten pupils. An experimental 
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An A-B-X paradigm consists of presenting 
word A, then word B r and finally the question 
"who said X?" , where X is either A or B. In 
an A-X paradigm, word A is presented, fol- 
lowed by word X, where X is either A or an- 
other word (B) . This is the traditional same- 
different paradigm. 
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group was formed from Ss making more than 14 
articulatory errors. From the remaining Ss, an 
equal number of control Ss was randomly se- 
lected. The two groups were then given a 
phonemic discrimination test consisting of 
initial and final contrasts. Both gioups showed 
a mean of 6.3 errors, or an overall error rate of 
31 .6%. Since the error rates for both groups 
were identical, no further analyses were con- 
ducted. It was felt that such a high error rate 
on what is presumed to be a simple task left 
only the task ltseir t.o be blamed. A retesting 
of a randomly selected portion of the experi- 
mental Ss one week Jater reinforced this idea. 
Subjects were inconsistent in the type of errors 
made. 

A final experiment in this series, reported 
by Skeel, Calfee, and Venezky (1969), tested 
fricative discrimination in preschool children. 
Results showed, as expected, that Ss in this 
age group (3 years 11 months to 5 years 1 
month) had considerable difficulty in discrim- 
inating fricatives from one another. A 28% 
overall error rate was obtained. Subjects were 
also tested on articulation of the same fricative 
set. The correlation between poor articulation 
ability and poor discrimination was significant 
(8 = .62, £ < .01). Item analysis , however, 
showed that articulation error patterns differed 
from discrimination error patterns. This dis- 
crepancy points out the danger of using gross 
scores in correlating articulation and speech 
discrimination ability. If speech sound dis- 
crimination eirors are not predictable from a 
S' 1 s articulation errors, then the correlation if 
obtained, is not meaningful. It is possible 
that both articulation ability and phonemic 
discrimination ability are related to some third 
variable (e.g,, general language processing 
ability) . 

The above studies represent a sample of 
attempts to manipulate variables in speech 
discrimination tasks in order to reduce error 
rates. The attempts have failed to show any 
improvement over previous experimentation in 
this area. The variables that were manipulated 
:n the experiments so far are not crucial in af- 
fecting task difficulty. If the assumption that 
task variables are interfering with accurate 
measurement of sound discrimination in chil- 
dren is true, then the identification of these 
variables is vital. 



THE PRESENT STUDY 

Assuming that, for young children, there is 
an inherent difficulty in normal phonemic dis- 
crimination tasks, major modifications in de- 
sign and administration seem necessary. 



Phonemic discrimination tasks usually involve 
minimally different pairs of syllables (e.g., 
/mom/ - /nom/) , One way of making the task 
less difficult would be to increase the dls- 
cr.'mlnabillty of items in a pair. It Is desirable, 
though, to control closely the contrasts so that 
specific inferences can be made about the cues 
Ss are using In discrimination. 

To solve the problem, we developed test 
Items called repeated contrast pairs, A re- 
peated contrast pair consists of two CVC syl- 
lables in which the same phonemic contrast 
occurs twice. For example, /p ee p/-/t a& fc/ Is 
a repeated contrast pair in which the phonemic 
contrast /p/-/t/ occurs in both initial and 
final positions. Subjects must use the phone- 
mic information to discriminate correctly but 
they are given the same information twice. 

Vowel length has been found to vary as a 
function of the following consonant (Peterson 
& I.ehiste, 1960). Skeel, Calfee, and Venezky 
(1969) suggest that Ss might be using these 
varying durations as cues to the discrimination 
of contrasting elements In final position. They 
found that errors were higher on CV syllables 
than on VC syllables involving the same con- 
trasts. Thus, repeated contrast pairs might 
include vowel duration cues in addition to 
^ther phonemic cues. Peterson and Lehiste 
(1960) identified four categories of consonants 
within each of which vowel length is not dif- 
ferentially affected. The categories are given 
in Table 1 . For CVC syllables, the consonants 
within a given category have the same effect 
on the duration of the vowel. Choosing con- 
sonant contrasts from within these categories 
will eliminate potential vowel duration cues. 
Discrimination, then, can only be made on 
the basis of the contrasting consonant elements. 



Table 1 

Final Consonants Classed According to Their 
Influence on the Duration of the Preceding 
Vowel (from Peterson & Lehiste, 1960), 



Range of Representative 
Vowel Duration in Centl- 
seconds 


Consonants 


19-21 


/p, t, k, c/ 


26-28 


/f, e, s, s/ 


30-32 


/b, d, g, ra, n/ 


37-41 


/V, 2, 2 / 



4 



o 

ERJC 



v 



In a recent criticism of the use of an A-B-X 
paradigm in phonological testing (Briere, 1967), 
it was found that the A-B-A alternative produces 
significantly more errors than the A-B-B alterna- 
tive. In light of Briere' s additional criticism of 
the A-X paradigm, the choice of a suitable task 
involves a dilemma. One solution is to use the 
A-B-X paradigm, but carefully counterbalance 
presentations of all items so that contrasts occur 
equally in A-B-A and A-B-B instances* This 
procedure was adopted for the present study. 

Of the studies reported above, only Skeel, 
Oalfee, and Venezky (1969) involved training 
Ss in the discrimination task. In that study, 



no significant effect of practice was found but 
the 1-week intervals between sessions may 
have mitigated the effectiveness of the train, 
ing. For the present study, these were impor- 
tant factors to be considered. 

The specific hypotheses to be tested in the 
present study are: 1) that the use of repeated 
contrast items will yield lower error rates than 
conventional pairs, and 2) that multiple test- 
ing will produce more stable performance in 
a disci imination task of this sort. Both hy- 
potheses represent attempts to reduce task 
difficulty without sacrificing control over test 
procedures . 



II 

METHOD 



EXPERIMENT I 
Materials 

Using the categories given in Table 1, 21 
repeated contrast pairs (e.g., /bob/ - /dod/) 
were constructed. For each phonemic contrast 
used in a repeated contrast pair, a minimal 
initial pair (e.g., /bob/ -/dob/) and a mini- 
mal final contrast pair (>.g., /bob/ - /bod/) 
were t.lso constructed. Of the 21 contrasts, 

13 were composed of consonants which were 
drawn from the same vowel length category 
(see Table 1) . The remaining eight contrasts 
were voiced-voiceless pairs (e.g., /p ap/- 
/o se b/) drawn from between categories. A 
list of all test pairs is given in Table 2. 

From the pairs in Table 2, six lists were 
formed. Each pair appeared once in the first 
three lists and once in the second three- Each 
list was composed of 24 test pairs, 7 each of 
repeated, initial, and final contrasts in ran- 
domized orders and 3 real word pairs (e.g., 
car-dog) at the end of the list. The real word 
pairs served as control pairs, Voicing pairs 
were interspersed with controlled vowel length 
pairs in the lists . 

All items were recorded on an Ampex stereo 
tape recorder at the WHA Radio Studios on the 
UW campus. The speaker's dialect is best 
described as Upper Midwestern. 



Procedure 

An A-B-X paradigm was used. A ready sig- 
nal (hell) sounded on the left speaker, followed 
by item A of a test pair on the same speaker. 
Item B of the pair was presented on the right 
speaker. The question "Who saidX?" occurred 
on both speakers. The S's task was to match 
X to A or B by pointing to the appropriate speak- 
er. Informative feedback was then presented 
over the correct speaker: "I said X." The next 
trial was signaled by the bell. 



Tapes were played on an Ampex stereo tape 
recorder, Model 1100. The speakers were 
placed 6 feet apart and directly to the right 
and left of the S s . 

Subjects were tested six times, once a day 
for about 5 minutes per day. Lists were ar- 
ranged in counterbalanced sequences across 
Ss. 



Subjects 

Twelve Kindergarten and 1 2 First Grade Ss 
from a Madison public school participated. 

The mean ages for the two groups were 5.5 and 
6.5 years, respectively. 

Responses were recorded by E on prepared 
data sheets. 

EXPERIMENT I! 

Materials 

Test pairs used in this experiment were 
those from the WeDman Test of Auditory Dis- 
crimination (WTAD) (1958). Two lists were 
used. Form I and Form Ii of the WTAD. 

Each form contains 30 minimally contrast- 
ing real word pairs (l.e., "different" pairs) 
and 10 non-contrasting real word pairs (i.e., 
"same" pairs). There are 13 consonant con- 
trasts involving change in placf; of aiticula- 
tion alone and 4 vowel contrasts. Each of the 
13 consonant contrasts appears once in initial 
position and once in final position on each form. 

Each form consists of 26 consonant contrasts, 
4 vowel contrasts, and 10 non-contrasting pairs, 
for a total of 40 pairs . 



Procedure 

Presentation of the items conformed to the 
instructions given for administration of the 



Table 2 

Complete List of Test Pairs Used for Consonant Discrimination Task 



Minimal-pair Minimal-pair 

Repeated Contrast Pairs (initial) (final) 



p*p 


- 


t«t 


p*p - 


tap 


pap 


- 


pat 


tat 


- 


kak 


tat - 


kat 


tat 


- 


tak 


test 


- 


V V 

C£3C 


V 

tec - 




V 

cat 


- 


Sa* 


faf 


- 


9*9 


faf - 


e«f 


faf 


- 


f®9 


0as8 


- 


8338 


03SS - 


6X8 


s®9 


- 


sas 


ea ss 


- 


V V 

saes 


8*8 - 


sas 


6*8 


- 


sas 


bob 


- 


dod 


bob - 


dob 


bob 


- 


bod 


dod 


- 




dog - 


g°g 


god 


- 


gog 


dod 


- 




dod - 


3od 


dod 


- 


do^ 


mom 


- 


non 


mom - 


nom 


mom 


- 


mon 


vov 


- 




vov - 


3ov 


VOV 


- 


vo3 


3o3 


- 


zoz 


3o* - 


203 


3o3 


- 


5oz 


70 Z 


- 


V v 

zoz 


zoz - 


V 

zoz 


zoz 


- 


V 

ZOZ 


P*P 


- 


basb 


pap - 


bap 


pap 


- 


pab 


tot 


- 


dod 


tod - 


dod 


•dot 


- 


dod 


ksek 


- 


g®8 


kag - 


g*s 


gak 


- 


gseg 


faef 


- 


vaev 


faf ~ 


Vffif 


faf 


• 


faev 


0350 


- 




0350 - 


3*6 


6x9 


- 


0a3 


SOS 


- 


zoz 


SOS - 


zos 


80S 


- 


soz 


SOS 


- 


V V 

ZOZ 


v * 

80S - 


ZOS 


* * 
80S 


- 


so z 


coc 


- 


Wi 


V V 

coc - 


joc 


V V 

COC 


- 


co} 



WTAD. From a position behind Ss, E read 
each pair aloud. Subjects were required to 
respond "yes" if the items in a given pair 
were the same and "no" if they were differ- 
ent. Errors were recorded on prepared data 
sheets. The E was the same speaker who 
prepared the tapes for use in Experiment I. 



Subjects 

Two groups of 1 2 First Grade Ss took part. 
The mean age for Group I was 6.8 years; for 
Group II the mean age was 6.9 years. Mean 



percentile scores on the Metropolitan Readi- 
ness Test were 76.8 for Group I and 81 . 5 for 
Group II. 

Each S in the first group was tested four 
times, once a day in individual sessions. 
Presentation of Lists I and II was counterbal- 
anced over Ss with one restriction: On Day 1 , 
a particular S received either Form I or II. 
That S^ then received the remaining Form on 
Day 2. The procedure was repeated on Days 
3 and 4. 

Group II consisted of the same 1 2 S s who 
participated in Experiment I. They were 
tested only once, with Ss assigned randomly, 
six to Form I, six to Form II. 
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RESULTS 



EXPERIMENT I 

Overall, the error rate for Experiment I was 
13.9%. For the Kindergarten Ss , the error rate 
was 17,7%, while it was 10.2% for the First 
Grade Ss . When divided by item types, final 
contrast pairs had an error rate of 16.6%, 
initial contrast pairs had an error rate of 15%, 
and repeated contrast pairs showed an error 
rate of 9.9%. 

Errors were tabulated for each £ for all 6 
days. Two preliminary analyses were per- 
formed. First, the errors on the 13 controlled 
vowel-length pairs were analyzed in a repeated 
measures analysis of variance, S_s nested 
within grades and crossed with contrast types 
(repeated, final, or initial) and days. Correc- 
tions for repeated measures were made, where 
appropriate, by the Geisser and Greenhouse 
method (1958) There was a significant effect 
of contrast types, F(1 , 22) =6.83,£< .0 25 . 
Subsequent tests by the Newman-Keuls m< thod 
showed a significantly lower mean number of 
errors on repeated contrast pairs than on either 
type of single contrast pair (£< .05). There 
was no difference in mean errors between 
minimal final and initial pairs (p > .05). The 
data for the eight voicing contrasts were ana- 
lyzed in the same manner. Again, the effect 
of items was significant, F(l, 22) = 5.29, 

£< .05. Subsequent tests again showed a 
lower mean number of errors for repeated con- 
trast pairs than for either type of single con- 
trast pair (£ < .05). Laterally, no difference 
between final and initial contrast pairs ex- 
isted, as the means were identical. In both 
analyses, no interactions were significant, 
so for the final analysis the errors for all items 
combined were analyzed by a similar repeated 
measures design, All main effects were sig- 
nificant: days (F (1 , 2 2) = 1 1 , 32, p < .01); 
contrast types (F (1 , 2 2) =13.21, £ < .01); and 
grades IF(1 , 2 2) = 5. 23, £<.05). None of 
the interactions reached significance. 



Subsequent tests by the Newman-Keuls 
method showed that the mean number of errors 
per S on repeated contrast pairs (4,2) was sig- 
nificantly smaller than either the mean errors 
per on initial pairs (6.4), or the mean errors 
on final pairs (7.0), £ < . 01 . There was no 
difference between the means for initial and 
final pairs (R > .05). 

Mean errors per S as a function of days is 
plotted in Fig. 1. Because there is no appro- 
priate statistic for evaluating the differences 
between levels of an ordered variable, the 
Newman-Keuls test was used. Only the dif- 
ferences between Day 1 and all other days 
were significant (£ < .01). 

Because Day 1 performance Is significantly 
worse than the performance on Days 2-6, error 
rates were calculated for the last 5 days, 
Overall, the error rate is 1 2,1%. For Kinder- 
garten. the error rate is 15.7%; fer first Grad- 
ers, it is 8.5%, For final, initial and re- 
peated contrast pairs, the error rates are 
14.3%, 14,0%, and 8,0%, respectively. 

Table 3 shows error rates separated by 
voicing contrasts versus others, In ail cases, 
voicing contrasts show a slightly lower error 
rate. 

Separate item analyses were also performed. 
Error rates separated by item types for all con- 
trasts are given in Table 4. Three contrasts, 
/{/ - /e/, /v/ - /5/ f and /m/ - /n/, account 
for 43% of all errors on controlled vowel-length 
pairs . 

One final analysis was conducted on the 
data from Experiment I. The A-B-A arid A-B-B 
errors for each S_ for each day were analyzed. 
Table 5 gives the number of A-B-A and A-B-B 
errors as a function of days and grades. A 
repeated measures analysis of variance showed 
no significant difference between the; mean 
errors on A-B-A ar.d A-B-B (F (1 , 22) » 3 • 35 , 
£>.05). There was a significant difference 
between the means for the interaction of A-B-A 
and A-B-B errors with days (F(l, 22) - 5.43 , 
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Figure 1. Mean Errors per S as a Function of 
Days and Grade for All Test Items 
Combined 



£< .05). Subsequent^ tests between the 
A-8-A and A-B-B error means on each of the 
six days revealed significantly more A-B-A 
errors only on Day 1 , 1(22) = 4.706, £ < .001 . 
There were no significant differences on sub- 
sequent days . 



EXPERIMENT II 

For this experiment, the overall error rate 
was 11%, excluding errors made on "same" 
pairs. Only 22 errors were made on "same" 
pairs, for a 5% error rate. 

Errors were tabulated for each S in Group I 
for each day. Again, "same" errors were ex- 
cluded. A subjects x days repeated measures 
analysis of variance was performed on the 
data. There was significant improvement over 
days , F (1 , 11) = 7.10, £ < . 05 , after correc- 
tion for repeated measures. The Newman- 
Keuls method was again used to evaluate dif- 
ferences between the means for the 4 days. 
Again, only the difference between Day 1 and 
a 4 other days was significant (£ < .05). 

As in Experiment I, the error rate was cal- 
culated after eliminating Day 1 errors. The 
reduced error rate is 8.7% (again, excluding 
"same" errors) . 

Since Group II was only tested once, the 
errors for each S. were tabulated for Day 1. 

A 1 test was performed on the Day 1 errors 
from Group I and the errors from Group II. 
Although the mean number of errors for Group 
Ion Day 1 was 5.4 and the mean number of 
errors for Group II was 2.9, the difference 
was not significant, t(22) = 1 . 89, £>.05. 



Table 3 

Percent Errors as a Function of 
Contrasts, Types of Items, and Grade 



Voicing Contrasts 


Controlled-Vowel Contrasts 




K 


G1 


All Ss 




K 


G1 


All Ss 


Redundant 


12.5 


4.7 


8.6 


Redurdant 


15.1 


6,4 


10.2 


Minimal Initial 


IS. 3 


10.4 


14.9 


Minimal Initial 


19.6 


11,5 


15.5 


Minimal Final 


18.7 


10.9 


14.8 


Minimal Final 


20, 2 


15.4 


17.8 
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Table 4 

Error Rates for Individual Contrasts 



Controlled Vowel Length Contrasts 


Voicing Contrasts 


Contrast 


Error Rate 


Contrast 


Error Rate 


P - t 


24% 


p - b 


15% 


t - k 


12% 


t - d 


19% 


t - c 


24% 


b - g 


18% 


f - 0 


74% 


f - V 


26% 


0 - s 


37% 


0 - 3 


37% 


S - 9 


18% 


s - z 


3 i% 


b - d 


17% 


8 - Z 


3 2% 


d - g 


11% 


c -3 


24% 


d -3 


17% 






m * n 


47% 






v - * 


43% 






3 - z 


25% 






Z - 2 


33% 







Table 5 

Total Errors on ABA and ABB as a Function of Days and Grades 





dfO *17-4*7-4 
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IV 

DISCUSSION 



For Experiment I, the major results indi- 
cate: (1) performance improves significantly 
after the first day of testing; (2) repeated 
contrast pairs are easier to discriminate than 
either initial contrast pairs or final contrast 
pairs; (3) a small set of contrasts account for 
a large proportion of the errors. 

It was argued earlier that task variables 
have inflated discrimination measure! in 
previous studies. The improvement over days 
is strong support for this position. Obviously, 
Ss do not learn to discriminate phonemes be-, 
tween the first two sessions. The significant 
improvement may be attributed to an increased 
facility with the task, or to factors of atten- 
tion, or to both. 

One specific example of such improvement 
is the reduction of errors in response to A-B-A 
alternatives. On Day 1, there are significantly 
more A-B-A errors than A-B-B errors , This is 
in complete agreement with the findings of 
Briere (1967), However, there are no signifi- 
cant differences between A-B-A and A-B-B 
errors on any of the succeeding days. What 
Biiere attributed to limitations in auditory 
memory span seems to be merely the result of 
attcntional processes or some simple form of 
task learning. 

While the reduction of A-B-A errors from 
Day 1 to Day 2 is large, it does not account 
for all of the improvement in performance. 

Thus, it may be speculated that there is some 
more global mechanism generating improvement 
to which the reduction of errors on A-B-A pairs 
is related , 

Most earlier studies assumed that Ss were 
attending to the critical attributes of stimuli 
as a matter of course. In the present study, 
superior performance on repeated contrast 
pairs implies that Ss may not always receive 
the critical information to discriminate c6n- 
ventional minimal pairs, Repeated contrast 
pat s provide Ss more than one opportunity to 
obtain the necessary information, thus demand- 



ing less of the child in terms of attention. 
Consequently, a truer picture of discrimina- 
tion ability in children can be obtained, un- 
contamlnated by attentional factors. 

The finding that three contrasts (A/ - /0/, 
/v/ - /o/, /m/ - /n/) accounted for 43% of 
all errors in Experiment I is not unusual or 
surprising. Such a finding has been reported 
by other investigators (Travis and Rasmus, 

1931; Templiu, 1943; Skeel, Calfee, & 
Venezky, 19G9). It is predictable from a 
knowledge of the impoverished acoustic cues 
involved. The fricative contrasts, A/ - /fl/ 
and /v/ - /o/ are distinguished, not by their 
acoustic energy spectra, but only by formant 
transitions, in contrast to other fricatives 
(Delattre, Liberman, & Cooper, 1962). Fur- 
ther, the six phonemes in question are marked 
by low intensity. It may also be that the 
auditory presentation precludes using impor- 
tant visual cues normally present with these 
phonemes, since they all involve labia 1 or 
dental movements. 

Another finding of interest is that, overall, 
there was no difference in peiformance on 
final and initial contrast pairs. This is in 
contrast to earlier studies reporting that final 
pairs were significantly more difficult to dis- 
criminate (Templin, 1957). 

Where voicing contrasts are concerned, 
vowel duration as a cue is confounded with 
voicing as a cue for repeated and final pairs. 
Initial consonants, though, have no effect on 
the length of vowels in CVC‘ s (Peterson & 
Lehlste, 1960). Thus, the initial contrast 
pairs for voicing contrasts are, in effect, 
controlled vowel-length pairs, since both 
CVC's in a pair end with the same consonant. 
Initial voicing contrasts serve as an adequate 
control In determining the effect of vowel dura- 
tion as a cue. There was no difference be- 
tween initial and final contrast pairs in the 
analysis of the controlled vowel-length mini- 
mal pairs. To conclude that Ss use vcwel 
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duration as an additional cue in discrimina- 
tion, final contrast pairs must exhibit fewer 
errors than initial pairs in voicing contrasts. 
Since no significant difference was found, 
there is no indication that Ss make effective 
use of vowel duration in discriminating CVC's 
differing in the final consonant, 

The generality of the statement that per- 
formance improves after Day 1 is supported 
by the results of Experiment II using the 
Wepman Test of Auditory Discrimination. Al- 
though the task, materials, and presentation 
differ from those used in Experiment I, the 
finding with respect to repeated testing is 
the same. In addition, this finding also sup-* 
ports the notion that general task variables 
can and do inflate error rate in conventional 
testing of speech sound discrimination. 

The use of Group II, the same First Grade 
Ss previously tested in Experiment I, was an 
attempt to measure the amount of transfer 
from previous discrimination testing. Although 
the number of errors for Group II was 35 as 
opposed to 65 for Group I on Day 1 , the dif- 
ference was not statistically significant. 

Thus, while there does seem to be some evi- 
dence of carry-over, it is oerhaps overly 
optimistic to expect that it should be reliable 
after a 5-month Interval. 

The experiments reported above indicate 
the need for several important modifications 
in methodology for optimal assessment of 
phonemic discrimination in young children. 
Repeated testing is a necessity. Since stable 
performance is reached after Day 1, purer 
measures can only be obtained In subsequent 
sessions. No study in this area has ever 



discarded results from an initial testing ses- 
sion. Therefore, conclusions in previous 
studies are based on error rates inflated b/ 
confounding factors and only partially related 
to discrimination ability. 

The use of repeated contrast pairs in the 
present study represents an additional step 
toward attaining purer discrimination mea- 
sures. Researchers have relied on normal 
minimal pairs without questioning their va- 
lidity in testing young children. Since the 
same control over contrasting elements is 
available with repeated contrast pairs, their 
use will sharply increase the accuracy of 
item or S-related measures. 

Several other aspects of the testing format 
used in Experiment I may prove to be desirable 
modifications of speech discrimination tasks. 
First, taped presentations permitted control 
of items across S^s , a precaution particularly 
necessary in diagnostic use of tests. Sec- 
ond, the choise of CVC nonsense syllables 
avoided the potential problems of familiarity 
inherent in the use of real words . Finally, 
the A-B-X paradigm required minimal instruc- 
tion in comparison to same-different tests; 

S^s 1 responses were not dependent on concepts 
of "same sounds" or "different sounds," 

The value of future research in the area of 
phonemic discrimination in children seems 
dependent on improvement of the testing in- 
strument; repeated testing and the use of re- 
peated contrasts to control for fluctuating 
attention would appear to be important steps 
in this direction. Certainly, past research 
must be re-evaluated in light of the present 
experimental findings. 
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APPENDIX 



SUMMARY OF PROCEDURES USED IN ATTEMPTS TO IMPROVE 
CONVENTIONAL TESTING 



In the first experiment, 25 Kindergarten 
pupils attending a public school in Madison, 
Wisconsin, served as Ss> whose ages ranged 
between 5 and 6 years, Thirty contrast pairs 
were constructed, All contrasts were initial 
consonant contrasts in CVC syllables (e,g., 
if/ - /sif/). Place of articulation, manner 
of articulation, and voicing were manipulated. 
Stops, fricatives, and affricates were in- 
cluded, Each S received all 30 contrasts. 

An MTA scholar teaching machine was modi- 
fied for presentation of the test items . A de- 
scription of the technique employed can be 
found in Skeel, Calfee, and Venezky (1969). 
The child faced a visual display while the 
test items were presented in an A-B-X para- 
digm. S, responded by pushing a panel that 
represented the source of the correct utterance. 

In the second experiment, 5 7 Kindergarten 
pupils between 5 and 6 years of age served as 
Ss . All attended a public school in Madison, 
Wisconsin. The only controlled difference 
between this experiment and the first is the 
position of the contrasting elements in the 
CVC syllable, In this study, only final con- 
trasts were tested. 

For the third experiment, 97 Kindergarten 
pupils attending public schools in Madison, 
Wisconsin, served as S s . Their ages ranged 
between 5 and 6 years. A list of 20 minimally 
distinct CVC nonsense syllables was con- 
structed. Both initial and final contrasts were 
included in the test. Stops, fricatives, af- 
fricates, and nasals {/m/ and /n/) comprised 
the contrasting elements. The technique using 
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the modified teaching machine was abandoned 
in this experiment. An Ampex 1100 stereo 
tape recorder was used in presenting the 
stimulus material. Left and right speakers 
were placed about six feet apart in front of 
S , who sat mid-way between them. A warn- 
ing signal was heard on the left speaker, 
immediately followed by the first member of 
the contrast pair. Approximately one second 
later, the second member of the pair was pre- 
sented over the right speaker. One second 
later M who said X M was presented over both 
speakers. Ss responded simply by pointing 
to the appropriate speaker. After a three sec- 
ond interval which allowed time for the re- 
sponse, "I said X" was heard over the appro- 
priate speaker. A warning signal initiated the 
next cycle. 

For the final experiment, 1 7 Ss, ranging 
in age from 3 years 11 months to 5 years 1 
month, were presented a total of 90 CV and 
VC contrast pairs over a 7-week period. Con- 
trast pairs consisted of six fricatives (/f /, 

/v/ r /0/, /3/, /s/, and /z/), each con- 
trasted with each of the other five, initially 
and finally, and in combination with three 
syllable nuclei {/a/, /i /, and /ai/), In the 
first week, Ss were given a pretraining task, 
For each of the subsequent six weeks, 15 con- 
trast pairs were presented along with seven 
control pairs (Interspersed In the list). These 
control items remained constant for all ses- 
sions. The modified MTA technique, referred 
to earlier In the description of the first experi- 
ment, was used . 






